Using an Ontology to Determine English Countability

نویسندگان

  • Francis Bond
  • Caitlin Vatikiotis-Bateson
چکیده

In this paper we show to what degree the countability of English nouns is predictable from their semantics. We found that at 78% of nouns’ countability could be predicted using an ontology of 2,710 nodes. We also show how this predictability can be used to aid non-native speakers to determine the countability of English nouns when building a bilingual machine translation lexicon.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcing English Countability Prediction with One Countability per Discourse Property

Countability of English nouns is important in various natural language processing tasks. It especially plays an important role in machine translation since it determines the range of possible determiners. This paper proposes a method for reinforcing countability prediction by introducing a novel concept called one countability per discourse. It claims that when a noun appears more than once in ...

متن کامل

Countability and Number in Japanese to English Machine Translation

This paper presents a heuristic method that uses information in the Japanese text along with knowledge of English countability and number stored in transfer dictionaries to determine the countability and number of English.noun phrases. Incorporating this method into the machine translation system ALTJ/E , helped tO raise the percentage of noun phrases generated with correct use of articles and ...

متن کامل

A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon

The present paper describes the current release of the Bochum English Countability Lexicon (BECL 2.1), a large empirical database consisting of lemmata from Open ANC (http://www.anc.org) with added senses from WordNet (Fellbaum, 1998). BECL 2.1 contains ≈ 11,800 annotated noun-sense pairs, divided into four major countability classes and 18 fine-grained subclasses. In the current version, BECL ...

متن کامل

Mass counts in World Englishes: A corpus linguistic study of noun countability in non-native varieties of English

Research on the morpho-syntax of non-native varieties of English has reported a widespread presence of mass noun pluralization such as baggages, equipments and softwares. In this paper we conducted a corpus linguistic study in order to provide empirically substantiated answers to this claim. We examined the purported prevalence of noun countability in World Englishes in a 1.9 billion-token mega...

متن کامل

Learning the Countability of English Nouns from Corpus Data

This paper describes a method for learning the countability preferences of English nouns from raw text corpora. The method maps the corpus-attested lexico-syntactic properties of each noun onto a feature vector, and uses a suite of memory-based classifiers to predict membership in 4 countability classes. We were able to assign countability to English nouns with a precision of 94.6%.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002